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ABSTRACT 



Various representations of convolutional codes useful in analyzing 
distance properties are presented. Row distance, column distance, 
minimum distance, and free distance are defined. Known bounds on these 
distances are summarized, and where instructive, the methods of proof 
are indicated. 

A novel approach to the distance structure of a code is given in 
the form of a plot of row distance and column distance against depth 
into the code trellis. Bounds on minimum distance are applied to deter- 
mine behavior of row and column distance. 

Finally, the problem of determining the length of sequence necessary 
to produce the minimum weight codeword is considered. A bound for syste- 
matic codes is presented. This bound appears to be the tightest bound on 
this length presently known. 
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I. CONVOLUTIONAL CODING 



A. INTRODUCTION 

A general convolutional encoder is shown in Figure 1. At each 
instant of time, an information symbol is present on each of the k 
input lines, and an encoded symbol is present on each of the n output 
lines (n^k) . The encoded symbols depend not only upon the information 
symbols present at that time but also upon the information symbols 
present during the previous m units of time. A code thus described is 
called an (n,K) convolutional code and is said to be of rate R = k/n and 
to have memory m, and constraint length m + 1 blocks or (m + l)n symbols. 

In general, the symbols are elements of a finite field GF (q) . This 
paper considers the binary case of symbols in GF (2) , and all operations 
on symbols shall be assumed to be carried out in GF(2) unless otherwise 
stated. 



B. POLYNOMIAL REPRESENTATION 

Using the delay operator notation where D^ represents a delay of j 
bit-times the input sequence into the jth input line may be represented 
as a formal power series X"* (D) . 



X*J)(D) _ x (j) + x|j)D + xp)D2 + ,j = 1,2, k 



Similarly, the output sequence from the jth output line is 



Y 0) (D) = y (j) * y 0) D * y°V 
0 1 2 



* . , j — 1,2,*.., n . 
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Each output sequence is a linear combination of digits in the input 
sequences, and therefore can be represented as 



nk polynomials known as code-generating polynomials. The highest power 
of D appearing in these polynomials is m. Specifying the code-generating 
polynomials completely specifies a convoluntional code. 

Suppose it is desired to specify a (2,1) convolutional code with 
m = 3. Then 2x1 polynomials with degree 3 at most are needed. 



That is, for any given input sequence one could then determine precisely 
the output sequence on each of the two output lines. 

C. SHIFT REGISTER REPRESENTATION 

A binary convolutional code can be represented (and implemented) 
using bit-time delay elements and modulo-2 adders. Figure 2 shows the 
configuration of an encoder for the previous example. Each symbol 0 
represents a GF(2) addition. Wherever a g^) appears, it represents a 
connection if gjk) = i and no connection if gj^) = o. Since the code 
generating polynomials determine the pattern of connections, the shift 
register encoder also completely specifies a code. Specifying shift 
register connections and specifying the generator polynomials are 
completely equivalent. 
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Figure 2. Representation of shift register encoder for R=l/2, m=3 code. 



Figure 2 also shows a sampler at the output of the encoder. The 
purpose of the sampler is to combine the output symbols on all of the 
output lines into a single output sequence, Y. It does this by sequen- 
tially sampling all output lines once during each delay period. For the 
example, the output sequence of the encoder becomes 



(1) (2) (1) (2) (1) (2) 

Y = y 0 y 0 y 1 y x y 2 y 2 



where the superscript indicates the line being sampled and the subscript 
is the time interval during which the sample is taken. When using the 
polynomial representation, the output sequence of the encoder can be 
obtained by interlacing the symbols of the n output line sequences. 



D. MATRIX REPRESENTATION 

A convolutional encoder may also be described by a generator matrix. 

Lin [1] describes in detail the construction of the generator matrix for 
a general binary convolutional code; this paper is confined to a description 
of how the matrix is used and an illustration of the generator matrix for 
the continuing example. 

If an input sequence is represented as a semi -infinite vector and the 
generator matrix as then has the property 



where is the vector representation of the output sequence of the encoder. 
For the example. 






4’> 4 2) gl (1) g< 2) g<» g< 2) g' X) gf> 0 0 0 0 0 0 
0 0 gf ,?> g»> g «> g™ 000... 

0 0 0 0 g(» g W gtl) g p) gCO g(2) (1) g(2) 0 0 . 
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It is easily verified that the matrix operation gives the same result as 
using the polynomials and interlacing the output symbols. It is, in fact, 
only a different way of describing exactly the same operations. It is 
included, however, because it will give another perspective when looking 
at some of the distance properties of convolutional codes. 

E. ADDITIONAL DEFINITIONS 

A convolutional code may be either systematic or nonsystematic. A 
systematic code is one for which the first k of every n transmitted 
symbols are the k information symbols being encoded. Then each group of 
n symbols may be considered as a code "word" with k information bits and 
n - k check bits. It should be noted, however, that unlike block codes 
the check bits depend on the previous mk information bits as well as the 
information bits in the word. 

‘ In the example, the code is systematic if G^(D) = 1. In shift 
register form, the code is systematic if the first k output lines each 
are connected directly to the respective input lines and nothing else. 

In Figure 2, the code is systematic if g^^ = 1 and g^ = g^^ = g^^ 0. 

Nonsystematic codes may be catastrophic error propagating. A code is 
catastrophic if there exists an input sequence of infinite weight which 
is encoded into an output sequence which has finite weight. Massey and 
Sain [2] prove a theorem which states that a rate 1/n convolutional code 
is free of catastrophic error propagation if and only if the greatest 
common divisor of the n generator polynomials is D^ for some integer value 
of L. Rosenberg [3] develops some properties of noncatastrophic codes 
which will be helpful when discussing distance properties of noncatastrophic 
codes . 
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II. DISTANCE MEASURES 



A. COLUMN DISTANCE 

Since convolutional codes are linear codes, the minimum distance 
between output sequences is also the minimum Hamming weight of a non-zero 
output sequence. The distance between sequences is a function of the 
length of the sequences being considered. Column distance , d ^ , is defined 
as the Hamming weight of the minimum weight output sequence generated by 
all input sequences of j + 1 symbols which begin with a non-zero symbol. 
For a rate 1/n code, d^ is found by the following procedure in polynomial 
representation : 

Y (k) (D) = G (k) CD) X“ (D) , k = 1 ,2, . . . ,n 



where 

*X (D) = x n + x D + x 0 D 2 + . . . + x .dJ* , 0 . 

0 1 2 j 0 

The next step is to truncate Y^ k ^(D) by setting all coefficients of powers 
of D greater than j equal to zero. 



Y 



00 



CD) 





D + 




D 



j 



Then 



, _ minimum 

j all xCD) 



n 



iE » H (?tk)cD)) 

k=l J 



where is the Hamming weight operator. 



One may also obtain d. directly from the matrix representation. The 
method here is to form a truncated version of the semi-infinite generator 
matrix G . The truncated matrix, called G . , is a matrix consisting of the 

OO J 
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first n(j + 1) elements of the first (j + 1) rows of 
the minimum weight of a linear combination of rows (which includes the 



Then d . is 
J 



first row) of G j . For the example in the previous chapter, the form of 
a generator matrix for a rate 1/2, m = 3 code was shown. Then to find d^, 
one forms the matrix 



— 2 



(1) (2) m (2) m (2) 



o 



g 



0 



g 



8 1 



g- 



g^ 



g 



(l) (2) fl) _ (2) 



0 



g 



8 1 



§ 1 



g 



CD „(2) 



0 



0 



which contains j + 1 rows and n(j + 1) columns. Consider all possible 
linear combinations of rows which include the first row. The minimum 
Hamming weight which results is d^. 

If the above matrix had the values 



110 110 

G = 0 0 110 1 

“2 

0 0 0 0 1 1 

then d^ = 3 which is the weight of the combination of all three rows. 

The above procedure is a shorthand method of finding the minimum weight 
output sequence, since each different linear combination of rows corresponds 
to the multiplication of an input vector and the generator matrix. Since 
only the weights are important at this time, and not the actual sequences, 
it is not necessary to examine the multiplication process which would yield 
each combination of rows. 
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Before looking at cL in terms of a shift register encoder, the 
encoder must be shown as a finite state device with 2 m possible states 
depending on the contents of the shift register. A trellis diagram [4] 
is a method of displaying the possible state transition trajectories. 
Figure 3 shows a simple encoder and its trellis diagram. Each node 
corresponds to a state of the shift register, and each branch to a 
transition from one state to another. The digits labeling each branch 
are the output symbols from the encoder produced by the transition. 

Column distance, dj , is the minimum Hamming weight of a path j + 1 
branches long which begins with the transition from state 0...00 to 
state 10. . .0. 

B. ROW DISTANCE 

Row distance, r., is. the Hamming weight of the minimum weight output 
sequence which can be generated by an input sequence of j + 1 + m symbols 
which begin with a non-zero symbol and end with m zeros. In polynomial 
notation for a rate 1/n code, the expression becomes 



minimum 
all X(D) 




+ W 

H 




(D)X(d) 



1 



where 

X(D) = Xq + x^D + x^D 2 + ... + x.D- 1 , Xq^O. 

In matrix notation a truncated version of which we will call is 
formed. is composed of the first n(j + 1 + m) elements of the first 

j + 1 rows of G_a, . For the m = 3 rate 1/2 code, the truncated matrix 
for finding r^ is 
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State 

00 



10 



01 



11 



00 00 00 




Figure 3. A rate 1/2, m=2 encoder and 
associated trellis diagram. 
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As before, one examines all linear combinations of rows which include the 

first. The minimum of the weights thus obtained is r.. 

1 

If the values of the elements of the matrix above were 



V 



110110110000 

001101101100 

000011011011 



then r^ = 6 which is the weight of the first row and also the weight of the 
combination of all three rows. 

To use the trellis diagram to determine r for a code, the minimum weight 

j 

of "eligible" paths of lengths j + 1 + m branches must be found. To be 
eligible, the first branch of the path must be a transition from state 00... 00 
to state 10... 0, and the final m branches must be transitions caused by zero 
input symbols. 

Column distance and row distance are very general measures of minimum 
distance between output sequences. There is no general way to express dj 
or r in algebraic form. When attempting to evaluate d^ and r for a given 
code the difficulty grows exponentially with j. It is easily seen that the 
methods for finding d^ and r. described above all consist of determining 
the weight of 2^ possible output sequences and then choosing the minimum. 
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' As j becomes large, this method quickly becomes impractical even with the 
use of a digital computer. More specific distance measures are used when 
attempting to judge the relative "goodness" of convolutional codes for use 
with particular decoding algorithms. The two most common and important of 
these are defined below. This paper will later describe how these distances 
are bounded . 



C. MINIMUM DISTANCE AND FREE DISTANCE 

Minimum distance, d . , refers to the minimum distance between initial 

min 

code words. An initial code word is an output sequence of length n(m + 1) 
symbols generated when the shift register is initially in the zero state. 
This says that d m i n = dj for j = m. Minimum distance is an important 
measure for a code, when a decoding algorithm is used which bases decisions 
on encoded sequences one contraint length long. One such algorithm is 
Threshold Decoding [5] . 

For decoding algorithms which make decisions based on much longer 
sequences such as Viterbi [6] decoding and sequential decoding, a reasonable 
measure of error correcting ability is free distance. Free distance is 
defined as 

j limit 



free j-*o° a j 



d.: . 



It will be shown later that this limit exists. Since for reasons mentioned, 
this is difficult to obtain for a code in general, many theorists have 
attempted to find an upper limit L such that dj for j = L gives free dis- 
tance for a code. This bound on computation required to find df ree is 
described later. 
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III. BOUNDS ON MINIMUM DISTANCE 



A. GILBERT LOWER BOUND 

Massey [5] proved a theorem which provides a lower bound on minimum 
distance for systematic convolutional codes which is equivalent to the 
Gilbert bound on minimum distance for block codes [7]. The following is 
a statement of Massey's theorem for a binary code, and a description of 
the method of his proof. 

THEOREM: Given a rate R = k/n and a constraint length n^ = n(m + 1), 
then there exists at least one convolutional code with a minimum distance 
d, where d is the largest integer for which 



METHOD OF PROOF : A convolutional code has minimum distance d if it has no 
initial code word of weight d - 1 or less for which some first information 
symbol is non-zero. Massey computes the total number of non-zero initial 



distinct codes in which each initial code word can appear. If this product 
is less than the total number of distinct convolutional codes, then a code 
must exist which has minimum distance d. The algebraic expression of this 
inequality is 




d-1 



code words which have wieght d - 1 or less and multiplies by the number of 




IS 



which reduces to 



d-1 



\ N.(l-R) 

; A r 2 



j=i 



Then, using the well known inequality 

fcn 



„ \ nH(S) 

* £ 2 , *41/2 



j=0 



the bound may be expressed as 



H( — ) < 1 - R 



n A 



where 



H(x) = -x log 2 x - (1-x) log 2 (1-x) 



Although Massey proves the bound for systematic codes, it is not 
difficult to extend the proof to include non-systematic codes, where 
"non-systematic" is used in the sense which excludes systematic codes, 
rather than in the sense which includes systematic codes as a special 
case. For systematic codes, choosing an initial code word specifies the 
information sequence, and the proof proceeds to deal with the (n - k)k 

parity bit polynomials. For a non-systematic code one need only specify 

2 

the first k of the required nk polynomials. Then an initial code word is 

2 

assumed. This word and the k polynomials uniquely determine the input 
polynomials. The proof then proceeds in the same fashion as for systematic 
codes except each side of the inequality must be multiplied by 2^ + ^ 

to allow for the number of ways the first k 4 polynomials may be specified. 
Since this factor appears on both sides of the inequality, the final 
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expression for the Gilbert bound remains the same. Thus if the Gilbert 
bound inequality is satisfied, it guarantees the existence of both a 
systematic and a non-systematic convolutional code with minimum distance 
of at least d. This bound only guarantees the existence of such codes, 
but does not reveal a procedure for finding them. 

B. PLOTKIN UPPER BOUND 

Massey [8] has also derived an upper bound on minimum distance for 
convolutional codes in a manner similar to the Plotkin bound for block 
codes. This section presents the form of the bound and omits the method 
of proof. Described later is the method of proof for another upper bound 
which yields a tighter bound for systematic codes, and the same bound for 
non-systematic codes as Massey's version. The Plotkin bound states that 
the minimum distance of a binary convolutional code satisfies the 
inequality 



C. MCELIECE-RUMSEY UPPER BOUND 

McEliece and Rumsey [9] derived an upper bound on minimum distance 
for systematic convolutional codes of rate 1/n. The statement of this 
bound is as follows: 




where R = k/n. For a rate 1/n code, this inequality reduces to 



d . 

min < 



I • 
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THEOREM: For each integer h il , 



d . ^ 

min — 



2 h -l 




m(n-l) ^ 



METHOD OF PROOF: McEliece and Rumsey demonstrate that a non-zero output 

polynomial generated by a non- zero input of length h is at least as likely 
to have a zero for the 1 1 coefficient as it is to have a one for its 1 
coefficient. The right-hand side of the inequality thus is an expression 
for the average weight of a non- zero code word caused by an input of length 
h, and the inequality results from the fact that the minimum weight can be 
no greater than the average weight. A corollary is presented which states 
if 



(n-1) (m+1) 

1 + log 2 [(n-1 (m + 1)] 

then 



d min < (n-lKm+n . " ^ f (l ,. 1)(B .„] . j 

Asymptotically this can be stated as 



limit 

m-*<» 



^min < j_ 

n. 2 

A 



(1-R) 



Costello [10] has generalized the McEliece-Rumsey bound to include all 
fixed (non-time varying) convolutional codes. The asymptotic form of 
this general bound is 



limit 

co 



d - 

mm < 1_ 
n A 2 • 
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D . SUMMARY 



Figure 4 shows plots of the asymptotic forms of the bounds described 
in this chapter [10] . The bounds are asymptotic because m is assumed to 
be arbitrarily large. It must be remembered that for a given rate, at least 
one code exists which has a ratio d/n equal to or greater than the Gilbert 

A. 

bound. No code exists which has a greater d/n^ ratio than indicated by the 
appropriate upper bound. Nothing is known about how to construct codes 
which are guaranteed to be in this region. For smaller values of m, on 
the order of m = 20, codes have been formed [11] which have minimum distances 
at or near the appropriate upper bound. 
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Plotkin Bound (systematic) 
Gilbert Bound 



Figure 4. Asymptotic bounds on minimum distance. 



IV. BOUNDS ON FREE DISTANCE 



A. PROPERTIES OF d AND r . 

J 1 

It is necessary to establish some properties of column distance and 
row distance which will be used when discussing bounds on free distance. 



Property 1 : r 0 = weight of the generator £ n^ 

0 d = n 
o 

An input sequence consisting of a one followed by all zeros generates 

the output sequence which has weight r . It is obvious that this weight 

o 

is exactly equal to the total weight of the generator polynomials and that 
this total weight cannot exceed (m + l)n. The weight of the first branch 
from the all-zero path is d Q , and since any single branch has weight no 
greater than n, d Q ^ n. Since d Q is maximum when all generators start 
with one, practical codes are universally chosen so d Q = n. 



Property 2: d^ ?= d + j = 0,1,2,... 

The general form of the truncated matrix G . 



+ 



1 



is 



G. 
~J + 



1 




00 * * *00 



x 



x 



x 



where each of the elements labeled x may be zero or non-zero depending upon 
the particular code. Since all elements below the submatrix G_j are zero, 
it is easily seen that any linear combination of rows of G . + i can have 
weight no less than the corresponding rows of G . alone. Thus d^ is 
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non-decreasing in j . It should be recalled that these linear combinations 
are being used to determine weights of sequences which would actually be 
produced by multiplication of an input vector with the generator matrix. 



Property 3: r. > r. + ^ j * 0, 1, 2,... 

The general form of G_j + ^ is 




G . 
~3 



0 • • • 0 

0 - • • 0 



00- * * xx xx 

where, as above, each of the elements labeled x may be zero or non-zero. It 

can be seen that if any linear combination of rows of G. . which include the 

~3 +l 

row outside of G j have weight greater than a combination of rows inside 
G^, then that combination is not minimum weight and is not used in determining 
r j + 1* Therefore, rj is non-increasing in j. 



Property 4: r^. > d_. , j = 0, 1 , 2, 



Another form for is 



G . = 
“J 



G. 

“3 



x ... x 



It is seen that no linear combination of rows of (f. can have weight less 

than the combination of the same rows of G . . Therefore, r- ^ d. for all j 

~3 3 j 



Property 5: The limits r 



limit . 

r . and 



d = limit j. exist, and r > d, 

CO • ^ ^ i 9 co=&c 

J-> CD J 
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Properties 1 through 3 indicate that { ] is a non-decreasing integer 

valued sequence bounded from above and { r j 1 is a non-increasing 
integer valued sequence bounded from below. Thus the sequences approach 
limits. From Property 4, r^ 5 d^ 

Since d^ and r j are integer valued, the sequences achieve their 
limiting values for some finite L and L such that d^ = d. and 



00 



= . Costello has shown that for non-catastrophic codes, r^ = d c 



Note that this does not imply L c = . 

It might be well to consider the properties of r^ and d^ in terms of 
a trellis diagram such as that in Figure 3. Column distance, dj , is the 
weight of the minimum weight path through the trellis which is j + 1 
branches long and has a 00... 00 to 10... 00 transition as its first branch. 
It can be seen that each- path of j + 1 branches has a path of j branches 
as a prefix, so the minimum weight of the j + 1 branches must be at least 
as great as the minimum weight of a j branch path. Thus d_. is a non- 
decreasing on j . 

Row distance, r., is the weight of the minimum weight path through the 
trellis which is j + 1 + m branches long, begins with a 00... 00 to 10... 00 
transition, and ends at a 00... 000 state. One possible path for r^ is the 
path which yielded r^ ^ plus a 00... 00 to 00... 00 transition. Thus r^. 



can never be greater than r 



j ~ 1 * 



Since all initially non-zero paths j + 1 + m branches long ending at 
the zero state contain as prefixes all initially non- zero paths j + 1 
branches long, r. can never be less than d.. 
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B. LOWER BOUNDS ON FREE DISTANCE 



The properties shown in the previous section can be stated as 

properties of various bounds. An upper bound on r^ for any j is an 

upper bound on d^ for all j. A lower bound on d^ for j = L is a lower 

bound on d. for j > L. It follows from this last statement that Gilbert 
J 

lower bound for minimum distance is also a lower bound on free distances. 
•'Neumann [12] has given a bound for non-systematic codes which states 



that there is at least one binary non- 


-systematic 


code such that 


for large 


values of m 










^free > 


2H 1 (1-R) 


for R 


0.37 




n A = 


2R(1-2 2R - 1 ) 


for R 


0.37 




H(1 


-2 2R_1 ) +2R-1 








A sketch of this bound is 


included in 


Figure 5. 


An interesting 


thing to 



note is that since the Neumann bound is above the upper bound for systematic 
codes for some range of R, then in that range of R there exists some non- 
systematic code with larger free distance than any systematic code. 

C. UPPER BOUND ON FREE DISTANCE 

An upper bound on minimum distance is not, in general, an upper bound 
on free distance. However, the McEliece-Rumsey bound, due to its method 
of proof, is actually a bound on r ^ and therefore a bound of d^ for all j. 
This could be expressed as 




2 j 

2^-1 



jn_ m(n-l) 
2 2 



/ 




McEliece-Rumsey Bound (nonsystematic) 



McEliece-Rumsey Bound (systematic) 

Gilbert Bound 

— — Neumann Bound (non-systematic) 



Figure 5. Asymptotic bounds on free distance. 
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The corollary used as a bound on column distance corresponds to the 
minimum of the right-hand side of the above inequality. The bound on 
Tj is only of interest in the region where it is decreasing, and as a 
bound on d. } of concern only at a minimum point. 



D. PERMISSIBLE REGIONS FOR d . AND r . WHEN R = T 

3 3 2 

Massey f s Plotkin bound on d . may be used to bound d. for j S m. 

J mm J 3 

This bound is the upper bound on the minimum weight of a code word m + 1 

branches long produced by an encoder with m delay elements. Since every 

code sequence shorter than m + 1 branches is a prefix of an m + 1 branch 

length sequence, Massey 1 s proof is valid while considering the first j 
/ 

delay elements. Thus 



d. < 3 + it!- , j = m, R = y 

J 2 2 



Savage [13] has shown that there exist codes for which d^ is lower bounded 
by a line from the origin to the Gilbert bound at j = m. When these bounds 
are plotted together with the McEliece-Rumsey bound, the resulting permissible 
region for {^j~\ is as indicated by the shaded region of Figure 6. 

Figure 7 shows typical forms of £r.j and [d_.^j for a non-catastrophic 
code. The rates of change of r_. and d^ are not known in general, do not 
approach a limit at the same rate, and may not change uniformly with j. 
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Figure 6. Permissible region for d-j . (Rate = y ) 




d. 

3 



r . 
3 



Figure 7. 



Typical behavior of r 



3 



and dj . 
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V. BOUNDS ON LENGTH OF OUTPUT SEQUENCE 



PRODUCING FREE DISTANCE 



A. INTRODUCTION 

Even after bounding free distance, the problem of determining the 
length of the output sequence which will yield d^ remains. This is an 

important parameter because between two codes with equal free distance, 
the code for which the smaller value of j yields d^ = d^ ree has the better 
error correcting capability. Also, if it is desired to compute the free 
distance of a particular code, a bound on the length of paths to be searched 
is a valuable measure of the maximum amount of computation which will be 
required. 

It is necessary to state explicitly some properties of non-catastrophic 

rate 1/n codes which will be used in the discussion. First, it is apparent 

that r is equal to the fhmming weight of the generators. That is, 
o 

r 0 = W H [ g(1) (°)] + W H [ g(2) M + -..+W h [g (i °(D)] . 

Since the weight of the generators is at most n(m + 1), 



r Q ^ n (m + 1) . 

It should be recalled from the properties previously stated that r is an 

o 

upper bound on d^ and r for all j. The second important property is the 
fact that a path through a trellis which yields r^ cannot leave the all-zero 
path once a zero state has been reached. Examination of the trellis shows 
any non-zero path which begins and ends at the all-zero state cannot touch 
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and depart from an all-zero node and still be a minimum weight path 
because any path departing from the all -zero path has weight at least 
as great as the all-zero path. Since m consecutive zeros in the input 
sequence drive the encoder to the all-zero state, the result is that any 
input sequence which determines the path yielding r ^ cannot have an 
internal sequence of m consecutive zeros. 

B. COSTELLO'S BOUND 

Costello uses the fact that an input sequence with no more than 

m - 1 consecutive zeros cannot produce an output with more than (2m - 2) 

consecutive zero-weight output branches for a rate — code. Then at 

least one non-zero element must be produced in every (2m - 1) encoded 

branches. Therefore at least 2(m + 1) ones must be produced in an output 

sequence of (2m - l)2(m + 1) + 1 branches. Since this is the upper limit 

of r and therefore d r , then 
o free 

dfree ” r 4m^ + 2m-l . 

Costello states that a similar argument can be used to show that for a 
systematic rate k/n code, 

d = r 

free (n - k) (m + l)m. 



C. AN IMPROVED BOUND 
1 

For rate y codes a better bound than Costell's can be derived by using 

tighter constraints. For the upper limit of d , use the McEliece-Rumsey 

free 



bound rather than the maximum weight of the generator. This bound for a 

1 



non-systematic rate — code is 



dfree < m + lo g 2 (m) + 1 
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A theorem by Odenwalder states that for a non-catastrophic rate 1/n code, 
the maximum number of consecutive zero weight branches which may occur is 
m - 1. This yields 

^free r jm + log^(m - 1) + 1 (m - 1) + 1 . 

While this is an improvement over Costello's bound, it is still on the 
order of m^. This bound was also derived by Bahl and Jelinek [14]. 



D. AN IMPROVED BOUND FOR RATE 1/n SYSTEMATIC CODES 

Figure 8 is a state diagram introduced by Viterbi [15] . The 0_ node is 
split so entrance into the diagram is from the zero node and exit from the 
diagram is the zero node. A path through the diagram represents a departure 
from and return to the all-zero path in the trellis. A broken line repre- 
sents a branch caused by a non- zero input and a solid line is a branch 
caused by a zero input. Since the code is assumed systematic, branches 
caused by a non-zero input will have a weight of at least one. 

Any path which yields r^ is a path through the state diagram. Since 
Tj is a minimum weight, the path must be non-looping. If an integer L is 
found such that the minimum weight of a non-looping path L branches long 
through the state diagram is at least as great as an upper bound on free 
distance, then 



and 



r 

L 



= r 



CD 



r = d 
L free 

The path whose weight grows most slowly with length is generated by an 
input sequence consisting of a one followed by the maximum permitted number 
of zeros followed by a one followed by the maximum permitted number of 
zeros, etc. The length of the zero sequences will be limited by the 



34 



00 




Figure 8. State diagram for m = 2 code. 




Figure 9. Tree of zero induced state transitions. 
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requirement that the path be non-looping, or the path shall not touch any 
node more than once. Minimum weight is assured by assuming a weight of 
one on branches caused by a non- zero input and a weight of zero on a branch 
caused by a zero input. 

Figure 9 is a tree representation of all transitions caused by zero 
inputs. In general, the tree is m branches deep. A path of m zeros is 
reserved for the last m input symbols necessary to generate a total path 
j + 1 + m branches long used to define r.. Since any path of m zeros must 
touch the node to the left of the 0^ node, a sequence of m - 1 consecutive 
zeros is not permitted. If it were, a node would be touched twice. Thus 
the first non-zero input may be followed by a sequence of at most m - 2 
zeros. If nodes touched by such a sequence are removed from the tree, it is 
seen that two sequences of m - 3 zeros are permitted. Removing nodes touched 
by these sequences leaves four sequences of m - 4 zeros. Thus the first m - 1 
output branches must have weight at least one, each of the next two m - 2 

output branches must have weight at least one, and so on. Thus an output 

sequence is determined which has the following branch length: 

L = m - 1 + 2(m - 2) + 4 (m - 3) +8 (m - 4) + ... 

The weight of this sequence must be at least 

W - 1 + 2+4+8+ .... 

H 

This can be put in the form of a summation and set greater than or equal to 
the McEliece-Rumsey bound. 

[ 2 ] + Q Z -for 1 ) foA 1 - ) + R iog 2 [ (n-1) (m+1)] + 1 

i=o 
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s 






where Q < • If the smallest integers which satisfy these relationships 

are called P* and Q* , then 

p* 

[ 2 1 (m-i-1 

i=o 



)] 



+ Q* (m-p*-2) 




It follows that 



d free r L* + 1 



For n = 2, and m = 31, L* = 598 and d_ = r ^ 

’ free 599. 



The Bahl-Jelinek bound reduced for systematic codes yields 



d free 660 • 

Costello's bound for a systematic code yields 

d free = r 992 • 

It is seen that the bound presented above is tighter than the Bahl-Jelinek 
bound, but the order of L* is still m2 , 

Miczo and Rudolph [16] examined the problem of bounding L* such that 
df ree = d^* and demonstrated by counter-example that L* cannot be any 
constant multiple of m. They speculate that a bound of the form (m log m) 
might be placed on L*, but this has not been done. 
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VI. SUMMARY AND RECOMMENDATIONS 



The various distance measures of convolutional codes have been 
defined, and the upper and lower bounds have been demonstrated. The 
bounds are based on the general structure of convolutional codes without 
considering the effect of specific code generating polynomials. In 
general, specifying generators does not provide a means of calculating 
distances since so little is known about the Hamming weight of products 
of polynomials. A logical extension of the topics in this paper is the 
study of codes whose generators have special properties. An examination 
of rate 1/2 complimentary codes by Bahl and Jelinek [11] is an example of 
such a study. 

The bounds on the length of sequences which yield df ree appear to be 
subject to more tightening. The bound at the end of the previous chapter 
considered each branch to be of the minimum possible weight. An attempt 
to assign a more realistic distribution of weights to the branches of the 
state diagram might result in a tighter bound. 

As mentioned earlier, the growth of dj with j is an important property 
of codes, and narrowing the permissible region of d. would be significant. 
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